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Abstract 

^ , We present an algorithm for computing Grobner bases of vanishing ideals of points that is optimized for 

■^Ih ' the case when the number of points in the associated variety is less than the number of indeterminates. The 

(-H I algorithm first identifies a set of essential variables, which reduces the time complexity with respect to the 



number of indeterminates, and then uses PLU decompositions to reduce the time complexity with respect to the 
number of points. This gives a theoretical upper bound for its time complexity that is an order of magnitude 
lower than the known one for the standard Buchberger-MoUer algorithm if the number of indeterminates is much 
larger than the number of points. Comparison of implementations of our algorithm and the standard Buchberger- 
MoUer algorithm in Macaulay 2 confirm the theoretically predicted speedup. This work is motivated by recent 
' t-^ , applications of Grobner bases to the problem of network reconstruction in molecular biology. 

1^^. Keywords: Grobner basis, vanishing ideal of points, zero-dimensional radical ideal, standard monomial, biological 

^+ . applications, run-time complexity. MSC: 13P10, 92C40. 

1 Introduction 

L^ ' Recently, Grobner bases have been proposed as a promising selection tool in applications to molecular biology 

^■"^ ' [71 [3]. In these applications, the data consists of m vectors of discretized concentration values in a finite field k 

for a network of n biochemicals. The data points can be viewed as an affine variety V with points in fc" of 

(^j ' multiplicity one and correspond to the vanishing ideal 'i.{V) of these points in the polynomial ring k[x\, . . . ,Xn\. 

r\ , Each variable Xi represents the i-th biochemical which takes on values in k. Typically, the number of data points 

jrt ' m = \V\ is on the order of tens, while the number of variables n may be in the thousands (for example, see 

[13]). This requires finding Grobner bases in situations were m <^ n, and the run-time of algorithms for this step 
constitutes a bottleneck for overall feasibility of these calculations. The primary motivation of this paper is to 
find an algorithm that optimizes run-time in the case when m <^ n. 

Several methods have been described and implemented for computing Grobner bases and the associated stan- 
dard monomials of vanishing ideals of points. In [TO], the authors presented the Buchberger-MoUer (BM) algorithm 
for computing the reduced Grobner basis of the ideal of a variety V over a field. The BM algorithm performs 
Gaussian elimination on a generalized Vandermonde matrix and its complexity is quadratic in the number of 
indeterminates and cubic in the number of points in 1/ [TJ |5] 1111 112) . Farr and Gao presented an algorithm based 
on a generalization of Newton interpolation [4]. While the complexity of their algorithm is exponential in the 
number n of indeterminates, the algorithm has been designed for the case in which n is small as compared to the 
number of points. Lederer proposed a method for lexicographic term orders which gives insight into the structure 
of the Grobner basis [8] . Cerlienco and Mureddu proposed a combinatorial method that uses Ferrers diagrams to 
compute the set of standard monomials for the vanishing ideal of a given set of points with respect to an inverse 
lexicographical order [2]. 

In [6], the present authors introduced a modification of BM specifically for the case when the number of 
points m in a given variety is less than the number of indeterminates n. The EssBM (for Essential Buchberger- 
MoUer) algorithm proposed in that paper identifies essential variables, that is, those in the support of the standard 



monomials associated to the ideal of the points, and computes the relations in the reduced Grobner basis in terms 
of these variables using BM. Since the standard monomials are in terms of at most m variables, the computation of 
a Grobner basis can be restricted to a proper subring of the underlying ring involving only the essential variables. 
EssBM was shown to have a worst-case complexity of 0{nm^ + m^), which is dominated by the first term when 
n S> m. 

Here we present an improvement of the EssBM algorithm in which we eliminate the use of BM altogether. This 
new algorithm, which we call EssGB (for Essential Grobner Bases), makes use of PLU decompositions providing 
an overall improvement in worst-case complexity to 0{nvn? + m*) for a fixed finite field. 

The remainder of our paper is organized as follows. In Section 2 we give a description of the algorithm and in 
Section 3 we provide the theoretical background for it. In Section 4 we estimate the worst-case time complexity 
of our algorithm. We conclude with a summary of the performance of an implementation of our algorithm in 
the computer algebra system Macaulay 2 on test data. These empirical tests confirm the theoretically predicted 
speedup relative to implementations of BM and EssBM on the same platform. 

2 The EssGB Algorithm 

Throughout this paper, let R = k[x-i, . . . ,a;„] denote a polynomial ring over a finite field k, and let -< be a fixed 
term order on R. For a = (ai, . . . , a„) £ Z^o, let a;" denote the monomial x\^ ■ ■ ■ x'^"- . 
Definition 2.1. The support of a monomial x"' G R is supp{x°-) = {xi : Xi\x°'}. 

This is not to be confused with the support of a polynomial f, denoted Supp{f), which is the set of monomials 
that occur in /. 

Let V C k" be a variety of points with multiplicity one and |V^| = m < oo. We consider the problem of 
computing the reduced Grobner basis of the vanishing ideal I(V') of the points in V with respect to -<. We call 
a Grobner basis G reduced if all generators are monic (leading coefficients are equal to 1) and for all g,h £ G, if 
g ^ h, then the leading term of g does not divide any monomial in Supp{h). We call a polynomial ,f £ R reduced 
with respect to G ii f is the normal form of a polynomial f' G R with respect to G, that is, / is the remainder 
of /' upon division by the elements of G. If the context is clear, we simply say that / is reduced. 

Let 7 C -R be an ideal and G a Grobner basis for / with respect to ^. For any f £ I, let LT{f) denote 
the leading term of / with respect to -< and tail{f) the polynomial / — LT{f). The ideal generated by the set 
{LT{g) : g £ G} is denoted by LT{G). Further, let SM{G) be the set of monomials not in LT{G). Note that 
SM{G) is a fc-vector space basis for R/I. We call SM{G) the set of standard monomials associated to G. In this 
paper, we restrict our attention to the case where I = I(V^) for a finite variety, that is, / is a zero-dimensional 
radical ideal, and SM{G) is a finite basis for R/I. 
Definition 2.2. A variable Xi is essential if Xi £ SM{G). 

Equivalently, Xi is essential if and only if there is a monomial x°' £ SM{G) such that Xi £ suppix'^). Let 
EV(G) denote the union of the supports of the standard monomials a;" £ SM{G). Note that LT{G), SM{G), 
and EV{G) depend only on the ideal 1{V) and the term order ^. Thus we can indicate this dependence by the 
notation chosen here. 

Let P = {pi, . . . ,ps} C A;" be a set of points. A polynomial f £ R is & separator of pi £ P ii f{pi) = 1 and 
f{j>j) = for all other pj £ P. Given a variety V of points of multiplicity one and a term order ■<, the EssGB 
algorithm returns the triple (G, SM{G), S), where G is the reduced Grobner basis of the ideal I(V^) of points in V 
with respect to ^; SM{G) is the set of standard monomials associated to G; and S is the set of reduced separators 
of the points in V . 

Initialize each set as follows: EVo — {} and 5* Mo — {lij}. Let [n] denote the set {1, . . . ,n} and for i £ [n], let 
EVi and SMi denote i-th approximations of the corresponding sets. 

For each i £ [n], do the following. Find the i-th smallest variable, say xt. Suppose there are r monomials 
i^i, . . . ,3;"'" in SMi-i. Note that these are fc- linearly independent. Try to write Xi as a fe- linear combination of 
these monomials. That is, find (if they exist) ci, . . . ,Cr £ k, where 



Xi{pi) -- 




Xi{p2) -- 


r 

^Y^c^x^Hp^) 


Xi{pm) = 


r 



(1) 



and x°'(pt) is the evaluation of a;" at the i-th point in V for t £ [m\. 

For solving the system (1) we wiU use a PLU decomposition Pi-iI/j-iC/j-i of the matrix Ai-i — (x^J (pt)) of 
the monomials x"'' G SMi-i evaluated at the points in V. This will reduce the time complexity at each step at 
which no new essential variable is added to EVi-i. Note that, in general, Ai^i will not be square, but will have 
dimensions m x r, where r = \SMi-i\ < m. Still, the standard Gaussian elimination procedure can be applied 
to find matrices Pi-i,Li-i,Ui-i whose product is Ai-i and such that Pi-i has dimensions m x m and undoes 
all row exchanges of the Gaussian elimination, Li-i is an 7n x tti lower triangular matrix with ones on the main 
diagonal, and Ui-i has dimensions m x r and is upper triangular in the sense that Uke = whenever k > £. Thus 
even if Ai-i is not square, it has a PLU decomposition in the above sense. 

If the system (1) has a solution, then it must be unique (see Lemma [3. 5 |l . If Cj — whenever Xi -< x"'\ then Xi 
is inessential and is the leading monomial of a polynomial in 'i.{V). In this case, EVi = EVi-\, SN'h = SMi-\, 
Ai = Ai-i, Pi = Pi-i, Li = Li_i, and Ui = Ui-i. 

If no solution exists or Cj ^ for some j with Xi -< x"' , then Xi is an essential variable and hence is a 
standard monomial. In this case let EVi = EVi-i U {xi}; compute the set SMi of standard monomials for the 
ideal 1{V) H k[EVi] of the points projected onto the variables in EVi (see Lemma l3.ip : and compute the PLU 
decomposition PiLiUi — Ai of the matrix Ai = {x"'^ {pt)) of the monomials x"'^ G SAdi evaluated at the points 
in V. 

At the end of the loop, all essential variables and standard monomials have been identified. The minimum set 
(with respect to inclusion) of generators x" of the leading term ideal of 1{V) is identified (see Lemma 13. 3p , and 
for each of these generators a polynomial a;" — g is computed so that the set of all of these polynomials forms a 
reduced Grobner basis for I(V). Finally, the set S of reduced separators is then computed by solving a system of 
linear equations. 

2.1 EssGB 

Let Af be an {m x n)-matrix with rows being the points of V, and -< a term order. We will assume that 
Xi ^ ■ ■ ■ ^ x„. 

Input: M; -< 

Output: {GB,SMn,S) where GB is the (reduced) Grobner basis for 1{V) with respect to -<, SM„ is the set of 
standard monomials for GB, and S is the set of reduced separators of the points in V. 

1. Initialize £Vo ■-{},SMo := {1r},GB := {},Ao — [1, . . . , 1]^ efc™,Po := Id{m),Uo — [1,0,..., 0]^ Gk"", 
and let Lo be the (m x 77i)-matrix that has ones in the first column and on the diagonal, and zeros elsewhere. 

2. FOR i e {l..n} do 

(a) Initialize Xi :— i-th smallest variable, r := I^Mi-il, and bi := i-th column of Af. 

(b) IF there is no solution c — [ci, . . . , Cr]^ to the system Pi^iLi-iUi-i ■ c = bi such that Cj = whenever 
Xi -< x°-^ 

THEN 
i. EVi ■- EV,-i U {xi}. 

ii. Compute SMi in fc[J5yi] using the algorithm SM-A. 

iii. Compute the matrix Ai := (x'^^ijpt)), for x°'^ G SMi and pt the point in row i of M. 
iv. Compute the PLU decomposition PiLiUi of Ai. 

3. Compute the set LT of generators of the leading term ideal of I(y) using the algorithm LT-A. 

4. FORj G {l..|iT|} do 

(a) Let bj — ix'^^ (pt)) be the (m x l)-vector of values of the monomial x'^^ G LT evaluated at the points pt 
in M. 



(b) Find a solution [ci, . . . , Cm] of P„LnUn ■ c — b 



(c) GB = GBU {x"^^ - E cex"'} where x"' G SM„. 

5. Compute the set 5* of reduced separators for M using the algorithm SP-A. 

6. RETURN GB, SM„, and S. 

2.2 Supporting algorithms 

This section contains the subroutines used in the main algorithm EssGB. 



2.2.1 SM-A 

The algorithm SM-A generates a set SMi of standard monomials for 1{V) fl k[EVi], given a newly identified 
essential variable Xi and the set SMi-i of standard monomials for I(V^) r]k[EVi~i]. It first constructs a sorted set 
of candidate monomials by forming all products x'^x'^ of monomials in SMi-i and powers of Xi, for < g < \k\. 
Then the monomials which are fc-linearly independent can be found by identifying the pivots of the evaluation 
matrix Ai :— {x°'^{pt)), where a;"' G C and pt is the point in row t of M. 

Input: Xi an essential variable; SMi-i. 

Output: SA'Ii the set of standard monomials for I(V) fl fc[£"l/i]. 

1. Compute the set C = {x'^x"' : a;" £ SMi^i, < q < \k\} of candidate standard monomials. 

2. Sort C so that C = {a;"!, . . . .a;''^ : s = \C\,x''^ -< aj^^+i for all j}. 

3. Compute the matrix A := (a;"^ (pt)), for x^J G C and pt the point in row t of M. 

4. Compute the row-echelon form U of A. 

5. Identify the columns 7r(l), . . . , n{r) corresponding to the r < s pivots of U. 

6. RETURN SMi = {x^'-w ,..., ar^-c-) }. 

2.2.2 LT-A 

This algorithm identifies all minimal leading terms a;" of I(V^). We use the following observation, which will be 
proved in the next section (Lemma 13. 3|l . 

Remark 2.3. The ideal LT{G) is generated by variables Xi ^ EV{G) and monomials x"" such that supp{x°') C 
EV(G), x" ^ SM{G), and x"' is minimal in the sense that no monomial in LT{G) divides x°' . 

Recall that SAf„ and EVv, are the sets of standard monomials and essential variables, respectively, after the 
execution of Step 2. We will assume that SMn and EV^ are sorted according to X. 

Input: SM^\ EV„. 

Output: LT the set of generators of the leading term ideal of liV). 

1. Initialize C := the (r x m)-matrix of ones, where r := |_El/„|,7n := IS'A/nl; LT :— {}. 

2. FORi G {l..r} do 

(a) FOR j G {l..m} do 

i. IF a;ix"J G SMn where Xi G EVn and x"J G SM„ 
ii. THEN C{i,j) :=0 
iu. ELSE FOR k G {l..j - 1} do 

A. IF C{i,k) == 1 AND (a-"j)%(a;"'') == 

B. THEN C{i,j) —0. 

3. FORi G {l..r} do 

(a) FOR j G {l..m} do 
i. IF C{i,j) == 1 
ii. THEN LT = LTU{xia-"j}. 

4. Remove repeated elements in LT. 

5. LT = LTD{x, ^EV„}. 

6. RETURN LT. 



2.2.3 SP-A 

The algorithm SP-A computes the separators of the points in 1/ = {pi, . . . ,Pm} in terms of the standard monomials 
associated to the ideal of the points. For each point pt, we wish to find a polynomial st{:K) = X]"Li Cja^"^ G 
k[xi, . . . , x„] that satisfies the following: 

m m 

VJ Cjx"^^ (pt) — 1; 2_] "^i^"^ (P^) = 0, for all ^ / t. 

We can do so by solving the system A^c = et, where An is the evaluation matrix An = {x"^^ {pe))t,je{i..7n} 
constructed during execution of EssGB, c = [ci, . . . ,Cm]'^ G fc™ is a vector of unknowns, and et is a standard 
column basis vector. 

Input: SMn — {x"^^ = 1, . . . , a;""*}, the set of standard monomials in increasing ^-order; {m x m)-matrix A„ in 

its PLU form P„LnUn- 

Output: S — {si(x), . . . , Sm.(x)} the set of reduced separators of the points in V. 

1. Initialize S = {}. 

2. FOR t e {l..?n} do 

(a) Compute c = [ci , . . . , Cm]'^ such that PnLnU„ ■ c — et- 

(b) 5' = 5U{s,(x):=Er=iC,^"0- 

3. RETURN S. 

3 Theoretical Background 
3.1 SM-A 

Recall that the SM-A algorithm computes a sorted list C — {x"^^ , ■ ■ ■ , x'^" } of candidate monomials and returns 
SMi — {a;""'!) , . . . , a;"'^'"') } C C, where 7r(l), . . . , 7r(r) refer to the columns of the row echelon form of A corre- 
sponding to pivots and A :— {x""^ (pt)) is the evaluation matrix computed in Step 3 of the subroutine. 

Lemma 3.1. Let SMi he the output returned by the SM-A subroutine, given an essential variable Xi and the set 
SMi-i of standard monomials for I{V)r]k[EVi-i]. Then Sh'U is the set of standard monomials for\{y)r\k[EVi\, 
where EVi = EVi-i U {xi}. 

Proof. The set C consists of all multiples of a;^ and a;" £ SAh-i and so generates the fc-vector space R/Ir]k[EVi]. 
The dimension of this space is equal to the number r of nonzero rows of the matrix U as computed in Step (4) of 
SM-A. 

Now consider x"''' G C that is not in the set SA'h returned by SM-A, and let U{k) consist of the first k columns 
of U. Then U{k — 1) and U{k) have the same rank and it follows that the fc-th column of A is a linear combination 
of the columns of A indexed j = l,...,fc — 1. This means that 

x^--J2<=j=^''' <^W), (2) 

for some coefficients Cj. Since the elements of C were listed in increasing order with respect to -<, the monomial x'^'' 
is the leading monomial in ([5} and therefore cannot be a standard monomial. Since there must be r standard 
monomials for 1{V) D k[EVi], these must by default be the monomials returned in Step (6) of SM-A. D 

Remark 3.2. At the end of Step 2 of EssGB, the set SM„ is indeed the set of standard monomials forI{V)r]k[EVn] 
(Corollary 2 in fS^). Furthermore, it is the set of standard monomials for 1{V) with respect to -< (Theorem 7 
inJBi)- 



3.2 LT-A 

Let LT be the output returned by LT-A and let EV„ and SA'In be the sets of essential variables and the standard 
monomials SMn as computed in Step 2 of EssGB. Define B to be the set B = {xi : Xi ^ EVn} U {a;" : supp{x'^) C 
EVn,x'^ ^ SMnjX"^ minimal}, where minimal means no monomial in LT divides x"" . 

Lemma 3.3. Let G be a Grobner basis for I(V^). Then the leading term ideal LT{G) is generated by B. 

Proof. Since the sets of leading terms and of standard monomials for an ideal are mutually exclusive, by definition 
B C LT{G). Let a;" G LT{G). If suppix") ^ EV„, then there is Xi € B that divides a;". Now suppose 
supp{x"^) C EVn. Clearly a;" ^ SM„. Since there are a finite number of divisors of a;", there is a; £ B that 
divides a;". Hence, B generates LT{G). D 

Note that B represents the minimum set (with respect to inclusion) of generators for LT{G). In particular, 
no monomial x" £ B divides any other monomial in B. Furthermore, the set LT returned by LT-A is the set B. 

3.3 SP-A 

We know that separators exist (see Corollary 2.14 in [12]). We also know that separators have a canonical form. 

Lemma 3.4. Let P C fc" be a set of points, ^ a term order, and G a Grobner basis ofI{P) with respect to -<. 
The reduced separators of the points in P can be written uniquely in terms of the standard monomials in SAI{G). 

Proof. Let / be a separator of a point in P. Since there is p £ P such that f{p) = 1, then / ^ J-{P)- Hence / 
is a nonzero element of R/I{P). As R/I{P) is generated (as a fc- vector space) by SM{G), then / has a unique 
fc-linear representation in terms of the standard monomials which is reduced with respect to G. D 

3.4 EssGB 

Lemma 3.5. For alii, the system Pi-^iLi-iUi-i-c — hi obtained during Step 2(b) of the execution of the algorithm, 
EssGB has at most one solution. 

Proof. Recall that Ai-\ := Pt-iLi-iUi-i is the (?n x r)-matrix {x'^' (pt)) where the monomials a;"^, . . . ,a;'''" £ 
SMi-i and pt is the point in row t of M. Since the monomials x""^ , ■ ■ ■ , a;""" are chosen to be linearly independent, 
the rank of Ai-i is r. Hence Ai-i has a trivial null space. D 

Recall that the LT-A algorithm returns the minimum set LT = B of generators for the leading term ideal 
of I(V). 

Lemma 3.6. A finite set G C i{V) is the reduced Grobner basis of 1{V) with respect to -< if and only if 

1. G is monic. 

2. {LT{g) ■.g£G} = B and 

3. Supp{tail{g)) C SMn for every g £ G. 



Proof. Let / — I(V). If G is the reduced Grobner basis for /, then (1) holds by definition. By Lemma 13.31 
B C {LT{g) : g G G}. On the other hand, we cannot have different g,h £ G with the leading term of g dividing 
the leading term of h. Therefore {LT{g) : g £ G} must be equal to the minimum set B of its generators, and (2) 
holds. Moreover, if a;" ^ SM„, then there must be some x'' £ B that divides a;", and hence a;" cannot be in 
Supp{tail{g)) for any g £ G, which is equivalent to condition (3). 

Now let G C I be a finite set that satisfies (l)-(3). Let H be any Grobner basis, and let / £ 1{V). Then 
the leading monomial of some h £ H divides LT{f), and by Lemma 13.31 some x^ £ B divides LT{f). Now (2) 
implies that LT{g) divides LT{f) for some g £ G. Thus G is a Grobner basis and is monic by (1). 

Finally, let g, h be different elements of G. Then LT{g) does not divide LT{h) by minimality of B. Moreover, 
LT{g) cannot divide any monomial in Supp{tail[{h)) , since by (3) the latter monomials are standard monomials, 
while LT[g) is not in SMn. D 

Theorem 3.7. Let {G,SMn,S) be the output returned by the EssGB algorithm, given a variety V and a term 
order -<. Then G is the reduced Grobner basis of 1{V) with respect to -<, SMn is the set of standard monomials 
associated to G, and S is the set of reduced separators of the points in V . 



Proof. This follows from Remark 13.21 and Lemmas 13.41 and 13.61 D 



The algorithm EssGB can be simphfied for lexicographical orders. Specifically once a monomial has been 
identified as a standard monomial in Step i of EssGB, then it continues to be a standard monomial in subsequent 
iterations. This property can be used to simplify the algorithm SM-A for the case of lexicographical orders. 
However, the simplification would not reduce the order of magnitude of our worst-case run-time estimate, and we 
did not implement it. 

4 Complexity of the Algorithms 

4.1 Complexity of SM-A 

Let p = |fc|. There are 0{pm) candidate monomials, which require 0{pm^ log{pm)) steps to sort, assuming that 
comparison of two exponents is an operation of cost 0{m). The matrix Ai has 0{m -pm) entries. Computing the 
row-echelon form of A has time complexity 0(pm ■ m?). Identification of the columns with pivots is an 0{m ■ pm) 
operation. Hence the worst-case complexity of SM-A is 

0{pm + pm log{pm,) + pm + pm ) — 0{pm, log{pm) + pm ). 

4.2 Complexity of LT-A 

As there are at most m^ candidate monomials, initialization of the matrix C requires 0{m,^) operations. The 
FOR loop in Step 2 is executed 0{m) times, similarly for the FOR loop in Step 2(a). Checking for membership of 
SMn in the IF clause of Step 2(a)(i) requires 0{m) operations. Checking for divisibility in Step 2(a)(iii) can be 
implemented by using a look-up table, and can be presumed to have a constant cost in each iteration of 2(a) (iii), 
while creating the look-up table requires a one-time cost of 0{m'^). In all, the cost associated to Step 2 is 0(771"^). 
Step 3 requires 0{m?) computations, while Step 4 requires 0(7Ti'^ log?n) computations. The last step requires 
0{n) computations as there are at most n — m inessential variables. Overall the complexity of LT-A is 

0(771 -f 771 -f 771 + m log 777, -|- 77) = Oijl + m ). 

4.3 Complexity of SP-A 

Initialization of the set S is a constant operation. For the FOR loop, since we are using the FLU decomposition of 
the matrix A, solving each of the 771 systems in 2(a) requires 0{m^) steps for forward and backward substitution. 
Maintenance of the set S in 2(b) requires m scalar multiplications. Hence, the complexity of the SP-A algorithm 
is 0(777)0(777^ + 777) = 0(777^). 

4.4 Complexity of EssGB 

Initialization has cost 0(771). In the main FOR loop (Step 2), the IF statement assumes that we have a linear system 
in PLU form and so requires 0{m?) operations for solving the system using forward and backward substitutions. 
Given no solution (entering the THEN clause), to compute the new set of standard monomials is 0(pm^ log{pm,) + 
pm^). Construction of the matrix At requires 0{m'^) operations since the numbers of its rows and columns are 
both bounded above by m, and another 0(777"^) to compute its PLU decomposition. Since there are at most 777 
essential variables, the THEN clause will only be executed 0(777) times, resulting in 

0(77)0(777 ) -f 0(777)0(777 + pm, [og{pm,)+pm, + m -I- 777 ) — 0{nm -\- pm log{pm)+pm, ) 

as the total cost for Step 2. 

Executing Step 3 is 0{n + mi?), as derived above. Construction of hj in Step 4(a) requires 0(777) operations, 
while solving the system in 4(b) requires 0[m?) operations each for forward and backward substitution. Appending 
to the list GB is an Oim) operation. Since there are at most 77 -I- 777^ leading terms, the total cost of Step 4 is 
0(77 + 777^)0(777^) = 0(77777^ + m^). 

Executing Step 5 is 0(777"^), as derived above. Thus the worst-case complexity of the EssGB algorithm is 

0(7r7) -f 0(777r7 + pm, log{pm)+pm, ) + 0(n + m, ) + 0(nm, +m ) + 0{m ) 

= O (p777 log(p777) +P777 + 77777 ) . 

If we assume p to be fixed, then the complexity can be reduced to 0(77777^ + 777''). For the applications to biological 
data where 77 S> 777, the complexity is dominated by 0{nm^). 



5 Performance of the EssGB Algorithm 

We compared the run-times of the algorithms EssGB, EssBM, and BM on randomly generated varieties contain- 
ing 771 points in fc", where A: is a finite field of the form Z/pZ. We performed this comparison in Macaulay 2, 
version 0.9.97, where each algorithm has been implemented. 

We generated r — 10 afBne varieties for changing values of p, n, and m. Since the algorithms require specifi- 
cation of a term order, we consider this to be parameter as well. The table below lists the values we used for this 
comparative study. 



Parameters 


Values 


p = cardinality of k 


{5, 101} 


n = number of variables 


{100, 200, 300} 


m = number of points 


{5, 10, 15} 


-< = term order 


{Lex, GRevLex} with default variable order 



For 1 < i < r, the i-th variety consists of nr{i) randomly generated points, where 

,.. ... 'r -i + 1' 
nrii) 

The remaining m — nr(i) points were generated using random homogenous linear polynomials gi, . . . ,gm-nr(i), 
where gj £ k[yi, . . . ,ynr{i)]- To generate the j'-th new point pj, the coordinates of Pj are computed individually; 
that is, for 1 < £ < n 

Pol ■= 9j {pu,---,Pnr(i)i) ■ 

Note that for i = 1,2, all points are randomly generated. This will result, with probability very close to one, 
in a variety where the points are in general position, that is, there are no linear dependencies among the points 
(Ell Pg- 7). In the runs for i = 3, . . . , 10, the enforced randomly chosen linear dependencies ensure that the linear 
span of the generated variety will have dimension < nr{i) (and equal to nr{i) with probability close to one). This 
choice of test data allowed us to compare run-times of the three algorithms on ideals of varieties with different 
geometric properties. 

We applied the three algorithms to each of the generated varieties. The run-time results are displayed in 
Figures [T] and O 



6 Discussion 

Recently, Grobner bases have been used as a selection tool in applications to molecular biology [T] |3] . In these 
apphcations, the number of data points m tends to be significantly smaller than the number of variables n. The 
computation of Grobner bases constitutes a bottleneck for overall feasibility of these calculations. The primary 
motivation for our paper was to find an algorithm that optimizes run-time in the case when m <ti n. 

The time complexity of the standard BM algorithm has been reported in the literature as quadratic in the 
number of variables n and cubic in the number of points m [10]. This makes it too slow for the applications 
mentioned in the preceding paragraph. In [6], we developed an algorithm EssBM that has a provable worst-case 
time complexity of 0{nm^ + m^) for a fixed finite field k. For the algorithm EssGB presented here, we can 
improve this worst-case estimate to 0{nm^ +m'^) for a fixed finite field k. The reduction from quadratic to linear 
scaling in the run-time was achieved in both EssBM and EssGB by first identifying the set of essential variables 
in a single loop of length n, and performing the most expensive steps of the computation only for these essential 
variables. While EssBM still uses BM as a subroutine on the reduced set of variables, EssGB eliminates calls 
to BM altogether and computes all relevant objects by solving systems of fc-linear equations. The coefficient 
matrices used in these equations change only when a new essential variable is encountered. This allows us to use 
PLU decompositions to reduce the cost to 0{m^) in all but m of the n steps of the main loop, and our overall 
worst-case estimate follows. 

Based on this estimate, one would expect our algorithm to be significantly faster than both BM and EssGB 
when m <^ n. We tested this prediction for randomly generated varieties, with \V\ = m £ {5, 10, 15}. We tested 
the algorithm on varieties that were generated totally randomly, which should ensure that the points will almost 
certainly be in general position, and on varieties where an increasing number of the points were expressed as 
linear combinations of previously defined random points. In order to ensure that we have enough different linear 
combinations of two points, the smallest field for which we tested our algorithm was Z/5Z. We also run tests for 
the rather large field Z/IOIZ. 



Our test runs neatly confirm that our algorithm EssGB has comparable performance with BM when n — 100, 
and significantly outperforms the latter when n — 300. The single exception are the simulations where m = 15, 
p = 101, and a GrevLex term order -< was used. In these simulations the performance of our algorithm becomes 
only comparable to that of BM when n = 300. However, the general pattern still holds: The more variables, the 
better EssGB performs relative to BM. 

We also observed that in general the run-times of EssGB are more consistent for different varieties under 
the same parameter settings than those for BM or EssBM. The only exception here are the experiments with 
m = 10, 15, p — 101, and a Lex term order -<, where similar magnitudes of run-time fiuctuations were observed 
for all three algorithms. The experiments with GrevLex term orders and p = 101 also show a significant decrease 
of the run-time of EssGB when the number of linear dependencies among the points in the variety increases. 

Our simulations do not in general show an advantage of our previous algorithm EssBM over BM, although 
EssBM clearly does become more competitive with BM as the number n of variables increases. Previous experi- 
ments reported in [6] had shown that EssBM outperforms BM when the number of variables starts exceeding 200. 
However, these experiments were run in implementation 0.9.8 of Macaulay 2, while the simulations presented here 
were run on version 0.9.97. We noticed a significant speedup of the run-times for both BM and EssBM between 
both versions; it was relatively larger for BM. 

In summary, both our theoretical run-time estimates and the test runs reported here indicate that EssGB 
would be the algorithm of choice if Grobner bases are to be found for a variety V in k[xi, . . . ,x„] such that 
|y| = m < n. 
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Figure 1: Run-times for the algorithms BM, EssBM, and EssGB for p e {5,101}, m, G {5,10,15}, and 
n e {100, 200, 300} with a defauh Lex order. 
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Figure 2: Run-times for the algorithms BM, EssBM, and EssGB for p e {5,101}, m, G {5,10,15}, and 
n e {100, 200, 300} with a defauh GRevLex order. 
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